SEQUENCE LISTING 



) GENERAL INFORMATION: 

(i) APPLICANT: Shy j an, Andrew W. 

,U, TITLE O. I»V««ION. g^^|^?,,^S,S°S^'SJSE»T OP TCMOK 

PROGRESSION 

(iii) NUMBER OP SEQUENCES: 9 

(Lv) CORRESPONDENCE ADDRESS: 

(iv) ";^^^DRgssEE: Pish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 
(P) ZIP: 02110-2804 

fv) COMPUTER READABLE FORM: 

(h) MEDIUM TYPE: Floppy di-SK 

)b( COMPUTER: IBM PC compatible 

\l\ OTERATING SYSTEM: PC-DOS/MS-DOS , 

iS) SJST Patentin Release #1.0, Version #1.30 

<vii CURRENT APPLICATION DATA: 
/Ai APPLICATION NUMBER: 

(B) PILING DATE: 23-MAY-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

fA> APPLICATION NUMBER: 08/623,679 
(B) FILING DATE: 29-MAR-1996 

fvii) PRIOR APPLICATION DATA: 
^ (A» APPLICATION NUMBER: 08/412,431 

(B) FILING DATE: 29-MAR-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fasse, J. Peter 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGTGCTGGAG TACCTCATGG GCGGTGCCTA CCGCTGCAAC TACACTCGGA AAAGCTTCCG 
OACTCTCTAC AACAACTTGT TTGGCCCTAA GAGGGTAGAG CTCAGCAGAC ACACAGO^TC 



1 



CTGTGCCTCC CAGAGTAACA TGTGGTTCCT TGATGTGCTT CCCCAAAAGC CCACCTGTGC 



180 



AGAATG 186 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AAGGAGGCTA GGCTGCACCC TTCCCGCTTG CTCAGCAGCT GAGGCAGGGT CAGAAAGCAT 60 
GGATAGAGAA GACATTTTGC AAAAGGGAAT GCATCTTTGT AATTCCCAGT ACAAAAGACC 120 
CTAACAGATG TTGCTGTGGT CAGCTCACTA ACCAGCACAT CCCCCCTTTG CCGAGTGGGG 180 
CTCCCAGCAC AACAGGAGAG GACACCAAGC AGGCAGACAC GCAGTCCGGG AAATGGTCTG 240 
TCAGCAAACA CACCCAGAGC TACCCAACAG ACTCCTATGG GATTCTTGAA TTCCAGGGTG 300 

I 

GGGGTTACTC CAATAAAGCC ATG TAC ATC CGA GTC TCC TAC GAC ACC AAG 350 

Met Tyr lie Arg Val Ser Tyr Asp Thr Lys 
5 1 5 10 

fU CCA GAT TCC CTG CTC CAC CTC ATG GTG A7VG GAC TGG CAG CTG GAG CTC 398 

^ Pro Asp Ser Leu Leu His Leu Met Val Lys Asp Trp Gin Leu Glu Leu 

i 15 20 25 

ry CCG AAG CTC TTG ATA TCT GTG CAC GGA GGC CTC CAA AGC TTC GAG ATG 446 

H Pro Lys Leu Leu lie Ser Val His Gly Gly Leu Gin Ser Phe Glu Met 

n 30 35 40 



%l CAG TCC AAA CTG AAG CAG GTG TTT GGG AAA GGT CTG ATC AAG GCT GCC 494 

lU Gin Ser Lys Leu Lys Gin Val Phe Gly Lys Gly Leu lie Lys Ala Ala 

yj 45 50 55 

Cl ATG ACC ACG GGG GCG TGG ATC TTC ACC GGG GGT GTG AGC ACT GGT GTC 542 

Met Thr Thr Gly Ala Trp He Phe Thr Gly Gly Val Ser Thr Gly Val 
60 65 70 

GTC AGC CAT GTG GGG GAT GCC TTG AAA GAC CAC TCC TCC AAG TCC AGA 590 
Val Ser His Val Gly Asp Ala Leu Lys Asp His Ser Ser Lys Ser Arg 
75 80 85 90 

GGC CGG CTC TGT GCT ATA GGA ATT GCT CCC TGG GGC ATG GTG GAG AAC 638 
Gly Arg Leu Cys Ala He Gly He Ala Pro Trp Gly Met Val Glu Asn 
95 100 105 

AAG GAA GAC CTG ATT GGA AAA GAT GTA ACA AGA GTC TAT CAG ACC ATG 686 
Lys Glu Asp Leu He Gly Lys Asp Val Thr Arg Val Tyr Gin Thr Met 
110 115 120 

TCC AAC CCT CTG AGC AAG CTC TCT GTG CTC AAC AAT TCC CAC ACT CAC 734 
Ser Asn Pro Leu Ser Lys Leu Ser Val Leu Asn Asn Ser His Thr His 
125 130 135 

TTC ATC TTG GCT GAC AAC GGC ACC CTG GGC AAG TAT GGT GCT GAG GTG 782 
Ph He Leu Ala Asp Asn Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val 
140 145 150 



830 



MiG CTT CGA AGA CAG CTG GAA AAA CAC ATC TCC CTG CAG AAG ATC AAC 
Lys Leu Arg Arg Gin L u Glu Lys His He Ser Leu Gin Lys He Asn 
155 160 165 170 

ACA AGG CTG GGC CAG GGT GTA CCT GTC GTG GGC CTA GTG GTA GAA GGT 878 
Thr Arg L u Gly Gin Gly Val Pro Val Val Gly Leu Val Val Glu Gly 
175 180 185 

GGT CCT AAC GTG GTT TCT ATC GTC CTG GAG TAT CTC AAA GAA GAC CCT 926 
Glv Pro Asn Val Val Ser He Val Leu Glu Tyr Leu Lys Glu Asp Pro 
' 190 195 200 

CCT GTC CCT GTG GTG GTT TGC GAT GGC AGT GGA CGT GCC TCT GAC ATT 974 
Pro Val Pro Val Val Val Cys Asp Gly Ser Gly Arg Ala Ser Asp He 
205 210 215 

TTG TCC TTC GCA CAC AAA TAC TGC GAC GAA GGA GGA GTC ATA AAC GAG 1022 
Leu ser Phe Ala His Lys Tyr Cys Asp Glu Gly Gly Val He Asn Glu 
220 225 230 

TCC CTG CGG GAC CAG CTT CTA GTT ACC ATT CAG AAA ACA TTT AAT TAC 1070 
ser Leu Arg Asp Gin Leu Leu Val Thr He Gin Lys Thr Phe Asn Tyr 
235 240 245 250 

AGC AAG TCC CAG TCG TAT CAG CTG TTT GCA ATT ATC ATG GAG TGC ATG 1118 
" ser Lys Ser Gin Ser Tyr Gin Leu Phe Ala He He Met Glu Cys Met 

' 255 260 265 

01 

Hi AAG AAG AAA GAA CTC GTC ACT GTG TTT CGG ATG GGT TCC GAG GGT CAG 1166 

IE Lys Lys Lys Glu Leu Val Thr Val Phe Arg Met Gly Ser Glu Gly Gin 

1= 270 275 280 

W CAA GAT GTC GAG ATG GCA ATT TTA ACT GCC TTG CTC AAA GGA ACC AAC 1214 

Gin Asp Val Glu Met Ala He Leu Thr Ala Leu Leu Lys Gly Thr Asn 
Q 285 290 295 

^ GCA TCA GCT CCA GAT CAG CTG AGC TTG GCC CTG GCT TGG AAC CGG GTC 1262 

Ala Ser Ala Pro Asp Gin Leu Ser Leu Ala Leu Ala Trp Asn Arg Val 
300 305 310 



^' GAC ATA GCG CGA AGC CAG ATC TTC GTC TTT GGC CCA CAC TGG CCG CCA 1310 

ASP He Ala Arg Ser Gin He Phe Val Phe Gly Pro His Trp Pro Pro 
315 320 325 330 

CTG GGA AGC CTG GCC CCT CCT GTG GAC ACC AAA GCC GCA GAG AAG GAA 1358 
Leu Gly Ser Leu Ala Pro Pro Val Asp Thr Lys Ala Ala Glu Lys Glu 
335 340 345 

AAG AAG CCA CCC ACA GCC ACC ACC AAG GGG AGA GGA AAA GGA AAA GGC 1406 
Lvs Lys Pro Pro Thr Ala Thr Thr Lys Gly Arg Gly Lys Gly Lys Gly 
^ 350 355 360 

AAG AAG AAA GGC AAA GTG AAA GAG GAA GTG GAG GAA GAG ACG GAC CCC 1454 
Lvs Lvs Lys Gly Lys Val Lys Glu Glu Val Glu Glu Glu Thr Asp Pro 
' 365 370 375 

CGG AAG CTT GAG CTG CTC AAC TGG GTG AAT GCC CTG GAG CAA GCC ATG 1502 
Ara Lys Leu Glu Leu Leu Asn Trp Val Asn Ala Leu Glu Gin Ala Met 
380 385 390 

CTG GAT GCT CTT GTC CTA GAT COG GTG GAC TTT GTA AAG CTC CTG ATT 1550 
Leu Asp Ala Leu Val Leu Asp Arg Val Asp Phe Val Lys Leu Leu He 
395 400 405 410 



430 

460 

495 500 

510 515 

525 530 
CCA CCT CTG CCC T GACACCCAGT GCAGGGCCTC CTAGCTTTCA CATGCAGCCA 
Pro Pro Leu Pro 
540 

TTCACATCGC CTCTCAAGAC TGGGCCAGGC AGTGCAACCT GTCAAGCATG TCTGTCCTCC 
CCTCCTTCCT ACAATAGCCC CCCCTCTGGG CCCCATGCCT CTGCTCTCTC AGCCCGTTCT 
CCTCCCCACT GATCACTGGC GCTCCTGTTG TCTTCCAAGG CAAGGAACAA GGAAAAGCAT 
CTTTTTGCCC ACAAAAGTTT AGGGCTCCCC GCTGTTCAAC CATAGCCAAC CTCACTGTAC 
ATCGGAGTCA TCCAGGCCAG CTGCCACACA CAAGCCTTCC CCACCCTATC CCAATAGACC 
CTATTCCTCC ATCAAAATCA AAGCTAACTC CTGGCCTGCC ACATTGCTTC TTCTTGCTCC 
AGCCTGTTAA ACCTCCAATA AATGTCAGAT CTGTGGGAAG CCTTCCTCAC TCTCACTCCA 
CAGTTTGTAC AGAGAGCGAG AGCCTCGTTT GGTTCTACTT ACAAGGAAGG CTTTGTGTCT 
GTCTGTCCTT CCCAACTGAC TTCTGTTGAC AGAAGCAGTT TCCACATGAA AGCGTTGACT 
CACCTGGATG TTGTCATTAA TTAATAGTGA TACAAAATAT TGACACTTCT TTTCCTGCTT 
CTTT6TTATG CAGCCGAAAG CACTTAAGCT TCTGGGAATG GAAGTAAGTA GGACATGTTT 
GTGGCAGTTT ATTTACTATA TATACCTTTG TCATTCTGTG GAAGCAAAAA TTGCAATGTT 
TTCCATGAAT AAAGCTCGTG CC 



1598 

1646 

1694 

1742 

1790 

1838 

1886 

1934 

1987 

2047 

2107 

2167 

2227 

2287 

2347 

2407 

2467 

2527 

2587 

2647 

2707 

2729 



(2) INFORMATION FOR SEQ ID NO:3t 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu His 
15 10 15 

Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser 
20 25 30 

Val His Gly Gly Leu Gin Ser Phe Glu Met Gin Ser Lys Leu Lys Gin 
35 40 45 

Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr Thr Gly Ala Trp 
50 55 60 

He Phe Thr Gly Gly Val Ser Thr Gly Val Val Ser His Val Gly Asp 
65 70 75 80 

Ala Leu Lys Asp His Ser Ser Lys Ser Arg Gly Arg Leu Cys Ala He 
85 90 95 

Gly He Ala Pro Trp Gly Met Val Glu Asn Lys Glu Asp Leu He Gly 
100 105 110 

Lys Asp Val Thr Arg Val Tyr Gin Thr Met Ser Asn Pro Leu Ser Lys 
115 120 125 

Leu ser Val Leu Asn Asn Ser His Thr His Phe He Leu Ala Asp Asn 
130 135 140 

Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Gin Leu 
145 150 155 160 

Glu Lys His He Ser Leu Gin Lys He Asn Thr Arg Leu Gly Gin Gly 
165 170 175 

Val Pro Val Val Gly Leu Val Val Glu Gly Gly Pro Asn Val Val Ser 
180 185 190 

He Val Leu Glu Tyr Leu Lys Glu Asp Pro Pro Val Pro Val Val Val 
195 200 205 

Cys Asp Gly Ser Gly Arg Ala Ser Asp He Leu Ser Phe Ala His Lys 
210 215 220 

Tyr Cys Asp Glu Gly Gly Val He Asn Glu Ser Leu Arg Asp Gin Leu 
225 230 235 240 

Leu Val Thr He Gin Lys Thr Phe Asn Tyr Ser Lys Ser Gin Ser Tyr 
245 250 255 

Gin Leu Phe Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val 
260 265 270 

Thr Val Phe Arg Met Gly Ser Glu Gly Gin Gin Asp Val Glu Met Ala 
275 280 285 



«u T.. -a. L u .ys ol, «„ M. S.r «. Pre «P OXn 

290 

ser Le« Leu Al. Trp »,n Val A.| He »1. s.r Oln 

305 

lie Phe val Phe Gly Pro Hie Trp Pro Pro Leu Gly Ser Leu Ala Pro 

325 

pro V.1 ASP Thr Al. Ala ol„ Ly. Glu Lye Lye P«. Pro Thr »1. 

340 

Thr Thr Lye oly Ar, oly Ly. Oly Lye Gly Lye Lye Lye Gly Lye Val 

355 

L,. 01» Olu val Gl» Gl» Glu Thr Aep Pre Arg Lye Leu Glu Leu Leu 

370 

Ae„ Trp val Aen Al. Leu Glu Gin Al. Met Leu Aep Ala Leu V.1 Leu 

385 

».p A., V.1 A.P Phe val Lye Leu Leu lie Glu Aen Gly Val Aen Het 

405 ^-^^ 
Gin Bie Ph. Leu Thr He Pro Arg Leu Glu Glu Leu Tyr Aen Thr AT, 

420 

^ Gl, pro pro Aen Thr Leu Hie Leu Leu V.1 Ar, Aep V.1 Lye Lye 
435 

ser A.n Leu Pro Pro Aep Tyr Hie He Ser Leu lie Aep He Gly Leu 
450 

val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Aen Tyr Thr Arg 

Ly! ser Phe Arg Thr Leu Tyr Aan Aen Leu Phe Gly Pro Lys Arg Val 

Glu Leu ser Arg His Thr Val Ser Cys Ala Ser Gin Ser Asn Met Trp 
500 505 

Ph. Leu A.P V.1 Leu pro Gin Ly. Pro Thr eye Ala Glu eye Aen ser 

515 

ser pro His Leu Ser Gin Thr Asp He Thr Pro Pro Leu Pro 
530 "5 

(2) INFORMATION FOR SEQ ID NO: 4: 

IL) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGGGAAGCAC ATCAAGGAAC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bas pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCAACTACTA CACTCGGAAA AGC 

(2) INFORMATION FOR SEQ ID NO: 6: 

/i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 346.. 4837 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
ACTCATTATA GGGNTCGAGC GGCCGCCCGG GCAGGTTTGA GCTGTGCCCT CTCCATTCCA 
CTGCTGTGGC AGGGTCAGAA ATCTTGGATA GAGAAAACCT TTTGCAAACG GGAATGTATC 
TTTGTAATTC CTAGCACGAA AGACTCTAAC AGGTGTTGCT GTGGCCAGTT CACCAACCAG 
CATATCCCCC CTCTGCCAAG TGCAACACCC AGCAAAAATG AAGAGGAAAG CAAACAGGTG 
GAGACTCAGC CTGAGAAATG GTCTGTTGCC AAGCACACCC AGAGCTACCC AACAGATTCC 
TATGGAGTTC TTGAATTCCA GGGTGGCGGA TATTCCAATA AAGCC ATG TAT ATC 

1 

20 25 
70 

mr.r^ nn^ OOT CGG GTT TGT GCT ATA GGA ATT GCT 

r? r? ^ ?s cy, II >=i,iie". 

85 



CCA TGO GGC ATC GTG GAG AAT AAG GAA GAG CTG GTT GGA AAG GAT GTA 690 
Pro Trp Gly He Val Glu Asn Lys Glu Asp Leu Val Gly Lye Asp Val 
100 105 110 115 

ACA AGA GTG TAG CAG ACC ATG TCC AAC OCT CTA AGT AAG CTC TCT GTG 738 
Thr Arg Val Tyr Gin Thr Met Ser Asn Pro Leu Ser Lys L u Ser Val 
120 125 130 

CTC AAC AAC TCC CAC ACC CAC TTC ATC CTG GCT GAC AAT GGC ACC CTG 786 
Leu Asn Asn Ser His Thr His Phe He Leu Ala Asp Asn Gly Thr Leu 
135 140 145 

GGC AAG TAT GGC GCC GAG GTG AAG CTG CGA AGG CTG CTG GAA AAG CAC 834 
Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu Glu Lys His 
150 155 160 

ATC TCC CTC CAG AAG ATC AAC ACA AGA CTG GGG CAG GGC GTG CCC CTC 882 
lie Ser Leu Gin Lys He Asn Thr Arg Leu Gly Gin Gly Val Pro Leu 
165 170 175 

GTG GGT CTC GTG GTG GAG GGG GGC CCT AAC GTG GTG TCC ATC GTC TTG 930 
Val Gly Leu Val Val Glu Gly Gly Pro Asn Val Val Ser He Val Leu 
ISO 185 190 195 

GAA TAC CTG CAA GAA GAG CCT CCC ATC CCT GTG GTG ATT TGT GAT GGC 978 
Glu Tyr Leu Gin Glu Glu Pro Pro He Pro Val Val He Cys Asp Gly 
200 205 210 

nj AGC GGA CGT GCC TCG GAC ATC CTG TCC TTT GC6 CAC AAG TAC TGT GAA 1026 

r Ser Gly Arg Ala Ser Asp He Leu Ser Phe Ala His Lys Tyr Cys Glu 

^ 215 220 225 

nJ GAA GGC GGA ATA ATA AAT GAG TCC CTC AGG GAG CAG CTT CTA GTT ACC 1074 

Glu Gly Gly He He Asn Glu Ser Leu Arg Glu Gin Leu Leu Val Thr 
fj 230 235 240 

VI ATT CAG AAA ACA TTT AAT TAT AAT AAG GCA CAA TCA CAT CAG CTG TTT 1122 

ry He Gin Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His Gin Leu Phe 

Ly 245 250 255 

fi GCA ATT ATA ATG GAG TGC ATG AAG AAG AAA GAA CTC GTC ACT GTG TTC 1170 

Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val Thr Val Phe 
260 265 270 275 

AGA ATG GGT TCT GAG GGC CAG CAG GAC ATC GAG ATG GCA ATT TTA ACT 1218 
Arg Met Gly Ser Glu Gly Gin Gin Asp He Glu Met Ala He Leu Thr 
280 285 290 

GCC CTG CTG AAA GGA ACA AAC GTA TCT GCT CCA GAT CAG CTG AGC TTG 1266 
Ala Leu Leu Lys Gly Thr Asn Val Ser Ala Pro Asp Gin Leu Ser Leu 
295 300 305 

GCA CTG GCT TGG AAC CGC GTG GAC ATA GCA CGA AGC CAG ATC TTT GTC 1314 
Ala Leu Ala Trp Asn Arg Val Asp He Ala Arg Ser Gin He Phe Val 
310 315 320 

TTT GGG CCC CAC TGG ACG CCC CTG GGA AGC CTG GCA CCC CCG ACG GAC 1362 
Phe Gly Pro His Trp Thr Pro Leu Gly Ser Leu Ala Pro Pro Thr Asp 
325 330 335 

AGC AAA GCC ACG GAG AAG GAG AAG AAG CCA CCC ATG GCC ACC ACC AAG 1410 
Ser Lys Ala Thr Glu Lys Glu Lys Lys Pro Pro Met Ala Thr Thr Lys 
340 345 350 355 



GGA CGA AGA GGA AAA GGG AAA GGC AA6 AA6 AAA GGG AAA 6TG AAA GAG 
Gly Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Gly Lys Val Lys Glu 
360 365 370 



GAA GTG GAG GAA GAA ACT GAG CCC CGG AAG ATA GAG CTG CTG AAC TGG 
Glu Val Glu Glu Glu Thr Asp Pro Arg Lys He Glu Leu Leu Asn Trp 
375 380 385 

GTG AAT GCT TTG GAG CAA GCG ATG CTA GAT GCT TTA GTC TTA GAT CGT 
Val Asn Ala Leu Glu Gin Ala Met Leu Asp Ala Leu Val Leu Asp Arg 
390 395 400 

GTC GAC TTT GTG AAG CTC CTG ATT GAA AAC GGA GTG AAC ATG CAA CAC 
Val Asp Phe Val Lys Leu Leu lie Glu Asn Gly Val Asn Met Gin His 
405 410 415 

TTT CTG ACC ATT CCG AGG CTG GAG GAG CTT TAT AAC ACA AGA CTG GGT 
Phe Leu Thr He Pro Arg Leu Glu Glu Leu Tyr Asn Thr Arg Leu Gly 
420 425 430 435 

CCA CCA AAC ACA CTT CAT CTG CTG GTG AGG GAT GTG AAA AAG AGC AAC 
Pro Pro Asn Thr Leu His Leu Leu Val Arg Asp Val Lys Lys Ser Asn 
440 445 450 

CTT CCG CCT GAT TAC CAC ATC AGC CTC ATA GAC ATC GGG CTC GTG CTG 
Leu Pro Pro Asp Tyr His He Ser Leu He Asp He Gly Leu Val Leu 
455 460 465 

GAG TAC CTC ATG GGA GGA GCC TAC CGC TGC AAC TAC ACT CGG AAA AAC 
Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr Arg Lys Asn 
470 475 480 

TTT CGG ACC CTT TAC AAC AAC TTG TTT GGA CCA AAG AGG CCT AAA GCT 
Phe Arg Thr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg Pro Lys Ala 
485 490 495 

CTT AAA CTT CTG GGA ATG GAA GAT GAT GAG CCT CCA GCT AAA GGG AAG 
Leu Lys Leu Leu Gly Met Glu Asp Asp Glu Pro Pro Ala Lys Gly Lys 
500 505 510 515 

AAA AAA AAA AAA AAG AAA AAG GAG GAA GAG ATC GAC ATT GAT GTG GAC 
Lys Lys Lys Lys Lys Lys Lys Glu Glu Glu He Asp He Asp Val Asp 
520 525 530 

GAC CCT GCC GTG AGT CGG TTC CAG TAT CCC TTC CAC GAG CTG ATG GTG 
Asp Pro Ala Val Ser Arg Phe Gin Tyr Pro Phe His Glu Leu Met Val 
535 540 545 

TGG GCA GTG CTG ATG AAA CGC CAG AAA ATG GCA GTG TTC CTC TGG CAG 
Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Val Phe Leu Trp Gin 
550 555 560 

CGA GGG GAA GAG AGC ATG GCC AAG GCC CTG GTG GCC TGC AAG CTC TAC 
Arg Gly Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys Lys Leu Tyr 
565 570 575 

AAG GCC ATG GCC CAC GAG TCC TCC GAG AGT GAT CTG GTG GAT GAC ATC 
Lys Ala Met Ala His Glu Ser Ser Glu Ser Asp Leu Val Asp Asp He 
580 585 590 595 



TCC CAG GAC TTG GAT AAC AAT TCC AAA GAC TTC GGC CAG CTT GCT TTG 
Ser Gin Asp Leu Asp Asn Asn Ser Lys Asp Phe Gly Gin Leu Ala Leu 
600 605 610 



™., a«n» ^ivo rar TCC TAT AAG CAT OAC GAG CAG ATC OCT ATG AAA 
GAG TTA TTA GAC CAG TCC TAT j^y^ 

Glu Leu Leu Asp Gin ser ryr uyo t- 

615 

630 ^35 

645 

CM ATO CTG CT» see 0»T MO «0 0G» MA C« coo M= ^ 
Gin Met Leu Leu Thr Asp Met Trp Met Giy Arg i.eu y ^ 
660 665 

„C CCC OOC CTO »0 OTT ATC AXO coo ATT 2; - S ? U 
Asn Pro Gly Leu Lys Val lie Met oiy xxe u 
680 " 

s ?re S5 ?s jj^aS ?^ |; ?s IS 



710 ^-^-^ 
725 "^^^ 



775 780 

790 

805 

820 ®25 

840 '^'^^ 

ATO ATT OOA AAO ATO ATO ATC »= ATO CTO »C TTT OTO OTC ATC ATO 

M t lie Gly Lys M t M t II Asp w ^ ^ Jf 



2226 



2274 



2322 



2370 



2418 



2466 



2514 



2562 



2610 



2658 



2706 



2754 



2802 



2850 



2898 



2946 



870 

^ ^^r, nnr rcT TGG AAA CTG GCC CGA AAC ATC TTC TAC ATG 

^ ^ S? ^ Leu AU «, A.n lie Ph. Tyr Met 

885 

900 

920 

935 

««v- «/v TTC TAT CTA CTG OTC GCC AAC ATC CTG CTG GTG AAC 
2S S ™ S S| val »1. ».« lie «u l«u val »„ 

950 

ss ?r. SI ?i? Ji; ?s ?s JTe 1^ ?s 

965 ^■'O 
TCC AAC CAG GTG TGG AAG TTC CAG CGA TAT CAG CTG ATT ATG ACA TTT 
Ser i^S Sn Val Trp Lys Phe Gin Arg Tyr Gin Leu lie Met Thr Phe 

980 

SI Z ?2 £2S ^ ?f» iS S K 



1015 1020 
1030 -^^^^ 



1045 



2? ^ ?S ISJe? ^ aT,^ 

1060 1065 10^0 

Sf, ?S ?S £SS i?? M =^ - io^ 

is Jii M ^ K s IS ?s 

sr. fl S,* M M S f I J^^iT? ?I? Jil 



1110 



CAG GCA 
Gin Ala 



CGG CAA 
Arg Gin 
1155 

CAT TTT 
His Phe 
1170 



OCT CTT 0« «T CTI GOO ATC «C J« TCT GAC »^ 
Ala Leu Glu Asn Leu Ala Gly He Asp Arg » 
1125 

1140 

- ^ iis £?f. ?s £ns z s vii 

1175 J-J-®^ 

liiva ACC TGT TCC TTC CGT ATA AAG GAA GAG 
X5 ?S ^ xS - Phe ne OU Olu 
1190 

1205 1210 
1220 1225 

0C» OTA GAT GAC TTA »« SS g" oiS S5 S Z 

Ala Val Asp ASP Leu Lys Asn Ala Glu ser i^ys ^^^^ 
1240 ^^^^ 



ATT GGG ATT TCA AAG GAA GAT GAT GAA AGA 
lie Gly lie Ser Lys Glu Asp Asp Glu Arg 
1255 "60 

GAA GAA ACT ATT TCC CCA AGT TTA AAT AAA 
Glu Glu Thr He ser Pro ser Leu Asn Lys 
1270 "75 

CAG GAC AAA TCA GAT GTT CAA AAC ACT CAG 
Gin ASP Lys ser Asp Val Gin Asn Thr Gin 
1285 "90 

AAT ATA GAA GGC ACT ATT TCC TAT CCC CTG 
Jan lie Glu Gly Thr He Ser Tyr Pro Leu 
1300 1^^^ 



CGC TAT TTC CCC GAT GAA ACG ATC AAT GCT 
Arg Tyr Phe Pro Asp Glu Thr He Asn Ala 



CAG ACA GAC TCT AAA AAA 
Gin Thr Asp Ser Lys Lys 
1265 

ACA GAT GTG ATA CAT GGA 
Thr Asp Val He His Gly 
1280 

CTA ACA GTG GAA ACG ACA 
Leu Thr Val Glu Thr Thr 
1295 

GAA GAA ACC AAA ATT ACA 
Glu Glu Thr Lys He Thr 
1310 1315 

TGT AAA ACA ATG AAG TCC 
Cvs Lys Thr Met Lys Ser 
5 ^ 1330 



AGA AGC TTC GTC TAT 
Arg Ser Phe Val Tyr 
1335 

CAG GAT GTA GAG TAC 
Gin Asp Val Glu Tyr 
1350 

TGG CAA TGC CAA GTT 
Trp Gin Cys Gin Val 
1365 



TCC CGG GGA AGA AAG CTG GTC GGT GGG GTT AAC 
ser Arg Gly Arg Lys Leu Val Gly Gly Val Asn 
1340 

APT TCA ATC ACG GAC CAG CAA TTG ACG ACG GAA 
III S« He ?S ASP Gin Gin Leu Thr Thr Glu 
1355 1360 

CAA AAG ATC ACG CGC TCT CAT AGC ACA GAT ATT 
SJS He Thr Arg Ser His S r Thr Asp II 
1370 1375 



3762 



3810 



3858 



3906 



3954 



4002 



4050 



4098 



4146 



4194 



4242 



4290 



4338 



4386 



4434 



4482 



CCT TAG ATT 6T6 TC6 6AA GCT GCA GTG CAA GOT GAG CAA AAA GAG GAG 4530 
Pro Tyr lie Val S r Glu Ala Ala Val Gin Ala Glu Gin Lys Glu Gin 
1380 1385 1390 1395 



UI 



TTT GCA GAT ATG CAA GAT GAA CAC CAT GTC GCT GAA GCA ATT CCT CGA 4578 
Phe Ala Asp Met Gin Asp Glu His His Val Ala Glu Ala He Pro Arg 
1400 1405 1410 

4626 



4674 



ATC 
He 


CCT CGC 
Pro Arg 


TTG TCC 
Leu Ser 
1415 


CTA 
Leu 


ACC 
Thr 


ATT ACT GAC 
He Thr Asp 
1420 


AGA 
Arg 


AAT 
Asn 


GGG 
Gly 


ATG GAA 
Met Glu 
1425 


AAC 
Asn 


TTA 
Leu 


CTG TCT GTG 
Leu Ser Val 
1430 


AAG 
Lys 


CCA 
Pro 


GAT 
Asp 


CAA ACT 
Gin Thr 
1435 


TTG 
Leu 


GGA 
Gly 


TTC 
Phe 


CCA TCT 
Pro Ser 
1440 


CTC 
Leu 


AGG 
Arg 


TCA 
Ser 


AAA AGT 
Lys Ser 
1445 


TTA 
Leu 


CAT 
His 


GGA 
Gly 


CAT CCT AGG 
His Pro Arg 
1450 


AAT 
Asn 


GTG 
Val 


AAA TCC 
Lys Ser 
1455 


ATT 
He 


CAG 
Gin 


GGA 
Gly 


AAG TTA GAC 
Lys Leu Asp 
1460 


AGA 
Arg 


TCT 
Ser 


GGA CAT 
Gly His 
1465 


GCC AGT 
Ala Ser 


AGT 
Ser 


GTA AGC 
Val Ser 
1470 


AGC 
Ser 


TTA 
Leu 


GTA 
Val 


ATT 
He 
1475 


GTG TCT GGA 
Val Ser Gly 


ATG 
Met 


ACA GCA GAA GAA AAA 
Thr Ala Glu Glu Lys 
1480 


AAG GTT AAG AAA 
Lys Val Lys Lys 
1485 


GAG 
Glu 


AAA GCT 
Lys Ala 
1490 



4722 



4770 



m- GTG TCT GGA ATG ACA GCA GAA GAA AAA AAG GTT AAG AAA GAG AAA GCT 4818 



TCC ACA GAA ACT GAA TGC T AGTCTGTTTT GTTTCTTTAA tTTTTTTTTT 4867 
Ser Thr Glu Thr Glu Cys 
1495 

TAACAGTCAG AACCACTAAT GGGTGTCATC TTGGCCATCC TAAACATCCA TCCAATTTCC 4927 
TAAAAACATT TTCCCTT 4944 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1497 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 7: 

Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu His 
15 10 15 

Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser 
20 25 30 

Val His Gly Gly Leu Gin Asn Phe Glu Met Gin Pro Lys Leu Lys Gin 
35 40 45 

Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr Thr Gly Ala Trp 
50 55 60 

He Phe Thr Gly Gly Val Ser Thr Gly Val He Ser His Val Gly Asp 
65 70 75 80 



Ala L u Lys Asp His Ser Ser Lys S r Arg Gly Arg Val Cys Ala II 
85 90 95 



Gly lie Ala Pro Trp Gly He Val Glu Asn Lys Glu Aap L u Val Gly 
100 

Lys ASP val Thr Arg Val Tyr Gin Thr Met Ser Asn Pro Leu S r Lys 
' 115 120 125 

Leu ser Val Leu Asn Asn Ser His Thr His Phe lie Leu Ala Asp Asn 

130 135 
Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu 



145 



Glu Lys 



150 



His He ser Leu Gin Lys He Asn Thr Arg Leu Gly Gin Gly 



165 170 

val pro Leu Val Gly Leu Val Val Glu Gly Gly Pro Asn Val Val Ser 
180 185 -^'^ 

val Leu Glu Tyr Leu Gin Glu Glu Pro Pro He Pro Val Val He 



He 



195 200 
Cys ASP Gly Ser Gly Arg Ala Ser Asp He Leu Ser Phe Ala His Lys 



210 



215 



Tyr cys Glu Glu Gly Gly He He Asn Glu Ser Leu Arg Glu Gin Leu 



225 



230 



Leu val Thr He Gin Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His 
245 250 



Gin Leu 



Phe Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val 



260 



265 



Thr val Phe Arg Met Gly Ser Glu Gly Gin Gin Asp He Glu Met Ala 
275 280 285 

Thr Ala Leu Leu Lys Gly Thr Asn Val Ser Ala Pro Asp Gin 



He Leu 
290 



295 



300 



Leu ser Leu Ala Leu Ala Trp Asn Arg Val Asp He Ala Arg Ser Gin 
305 310 315 

He Phe val Phe Gly Pro His Trp Thr Pro Leu Gly Ser Leu Ala Pro 
325 330 ■'•»=> 

Pro Thr ASP ser Lys Ala Thr Glu Lys Glu Lys Lys Pro Pro Met Ala 
340 345 

Thr Thr Lys Gly Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Gly Lys 
355 360 365 

val Lys Glu Glu Val Glu Glu Glu Thr Asp Pro Arg Lys He Glu Leu 

Leu Asn Trp Val Asn Ala Leu Glu Gin Ala Met Leu Asp Ala Leu Val 
385 390 395 

Leu ASP Arg Val Asp Phe Val Lys Leu Leu He Glu Asn Gly Val Asn 



405 



M t Gin His Ph Leu Thr He Pro Arg Leu Glu Glu Leu Tyr Asn Thr 



420 



425 



Arg Leu Gly Pro Pro Asn Thr Leu His Leu Leu Val Arg Asp Val Lys 

Lys Ser Asn Leu Pro Pro Asp Tyr His lie S r Leu He Asp lie Gly 

450 ^55 
Leu val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr 
465 *70 

Arg Lys Asn Phe Ar| Thr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg 
Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp Glu Pro Pro Ala 



Pro 



Asp 



500 



Gly Lys Lys Lys Lys Lys Lys Lys Lys Glu Glu Glu He Asp He 



515 



520 



val Asp ASP pro Ala Val Ser Arg Phe Gin Tyr Pro Phe His Glu 



530 535 
Leu Met val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Val Phe 
545 

Leu Trp Gin Arg Gly Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys 



565 



Lys Leu Tyr Lys Ala Met Ala His Glu Ser Ser Glu Ser Asp Leu Val 

580 

Asp ASP lie ser Gin Asp Leu Asp Asn Asn Ser Lys Asp Phe Gly Gin 

595 

Leu Ala Leu Glu Leu Leu Asp Gin Ser Tyr Lys His Asp Glu Gin lie 
610 

Ala Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ser Thr 

625 

cys Leu Lys Leu Ala Val Ala Ala Lys His Arg Asp Phe He Ala His 
^ 645 

Thr cys ser Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg 

665 o/u 



660 



Met Arg Lys Asn Pro Gly Leu Lys Val He Met Gly lie Leu Leu Pro 

675 

pro Thr He Leu Phe Leu Glu Phe Arg Thr Tyr Asp Asp Phe Ser Tyr 
690 

Thr ser Lys Glu Asn Glu Asp Gly Lys Glu Lys Glu Glu Glu Asn 
710 715 

Thr ASP Ala Asn Ala Asp Ala Gly Ser Arg Lys Gly Asp Glu Glu Asn 

725 730 '^"^ 

Glu His Lys Lys Gin Arg He He Pro He Gly Thr Lys He Cys Lys 



Gin 
705 



740 



Phe Tyr Asn Ala Pro He Val Lys Phe Trp Phe Tyr Thr II S r Tyr 
755 '^^O 



Leu 61y Tyr Leu Leu Leu Phe Aan Tyr Val lie L u Val Arg M t Aso 
'70 775 

Gly Trp Pro Ser Leu Gin Glu Trp He Val He Ser Tyr He Val Ser 

790 795 

Leu Ala Leu Glu Lys He Arg Glu He Leu Met Ser Glu Pro Gly Lvs 
805 810 815 

Leu Ser Gin Lys He Lys Val Trp Leu Gin Glu Tyr Trp Asn He Thr 
820 825 830 

Asp Leu Val Ala He Ser Thr Phe Met He Gly Ala Met Ala Thr Arg 
835 840 845 

Ser Val Met Met He Gly Lys Met Met He Asp Met Leu Tyr Phe Val 
850 855 860 

Val He Met Leu Val Val Leu Met Ser Phe Gly Val Ala Arg Gin Ala 

870 875 880 

He Leu His Pro Glu Glu Lys Pro Ser Trp Lys Leu Ala Arg Asn He 
885 890 895 

Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu Val Phe Ala Asp Gin 
900 905 910 

He Asp Leu Tyr Ala Met Glu He Asn Pro Pro Cys Gly Glu Asn Leu 
915 920 925 

o?P ^9 Leu Pro Pro Cys He Pro Gly Ala Trp 

930 935 94Q 

Leu Thr Pro Ala Leu Met Ala Cys Tyr Leu Leu Val Ala Asn He Leu 

950 955 ggQ 

Leu Val Asn Leu Leu He Ala Val Phe Asn Asn Thr Phe Phe Glu Val 
965 970 975 

Lys Ser He Ser Asn Gin Val Trp Lys Phe Gin Arg Tyr Gin Leu He 
980 985 990 

Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro Met He He Leu 
995 1000 1005 

^^"^ ^^"^ ^9 Leu ser Gly Arg Cys Arg Lys 

•"■"•lO 1015 1020 

Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly Leu Lys Leu Phe 
■^025 1030 1035 io40 

Leu Ser Asp Glu Glu Leu Lys Arg Leu His Glu Phe Glu Glu Gin Cvs 
1045 1050 1055 

val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin Gin Ser Ser Ser 
1060 1065 1070 

Asp Glu Arg He Arg Val Thr Ser Glu Arg Val Glu Asn Met Ser Met 
1075 1080 1085 

^'^^ Thr Phe Met Lys Thr S r Leu 

1090 1095 1100 



Gin Thr Val Asp L u Arg^Leu Ala Gin L u Glu^Glu Leu Ser Aen Arg^ 



1105 



1110 



„ t val ABH Ala Leu Olu Asn I. u Ala Gly^He Asp Arg S r Asp^Leu 



1125 



,le Gl„ Ma l>r,_Ser Ala Sar ser^olu C olu Ala Thr^Tyr .e» 



1140 



^ Gl„_ser ser He Asn S» Ala Asp 01, Tyr Sar^leu Tyr Ar, 



1155 



Tyr His Ph, AS„ Oly olu olu Leu 1^ Phe olu Asp^Thr ser Leu Ser 

1170 '^■^'^ 
Thr ser Pro Gly Thr Gly Val Arg Lys Lys Thr^Cys Ser Phe Arg Ile^ 
1185 115° 



^.ys Glu Glu Lys Asp^Val Lys Thr His Leu^Val Pro Glu Cya Gln^ABn 
ser Leu Hie Leu_Ser Leu Gly Thr Ser^Thr Ser Ala Thr Pro^Asp Gly 



1220 



ser His leu Ala val Asp Asp Lys As„ Ala olu Glu^Ser Lys l^u 



1235 



1240 



Oly pro ASP Ue ol. He Ser Lys Glu Asp Asp « Gl„ Thr Asp 

1250 ^255 
ser Lys Ly. Glu olu Thr He ser Pro Ser Leu «n Lys Thr Asp val^ 
1265 "70 

lie His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr Gin Leu Thr^Val 

1285 ^^^^ 

, ^1,, fpv.^ Tie ser Tvr Pro Leu Glu Glu Thr 

Glu Thr Thr Asn He Glu Gly Thr Ser xyr ^^^^ 

1300 ^-^^^ 
Lys lie Thr.Ar, Tyr Phe Pro Asp^Glu Thr He Asn Ala^Cys Lys Thr 



1315 



M« Ly. ser A., Ser Phe V.1 Tyr ser Ar, Oly Arj^Lys Leu Val oly 

1330 

Oly val Asn Gin Asp Val Glu Tyr Ser Ser Ile^Thr Asp Gin Gin Leu^ 



1345 1350 



Thr Thr 



Glu Trp Glncys Gin Val Gin Lys^Ile Thr Arg Ser His^Ser 



1365 



Thr ASP He pro Tyr He val ser olu Ai. Ala val Oln Ala^Glu oln 

1380 

Lys Olu Oln Phe Ala Asp He. oln A.p olu Bis Hi. Val Ala olu Ala 

1395 

lie pro Arg He Pro Arg Leu^Ser Leu Thr He Thr^Asp Arg Asn Gly 



1410 1*15 



M t Glu Asn Leu Leu Serial Lys Pro Asp Gln^Thr Leu Gly Phe Pro^ 



1425 



1430 



1 5 i 



Ser Leu Arg S r Lys^Ser Leu His Gly HiB^Pro Arg Asn Val Lys^Ser 

II Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser Ser Val Ser Ser 

1460 14^5 

Leu val lie Val Ser Gly Met Thr Ala Glu Glu Lys Lys Val Lys Lys 
. mi- 1480 ±tos 



1475 



Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1490 1495 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 346.. 4945 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ACTCATTATA GGGNTCGAGC GGCCGCCCGG GCAGGTTTGA GCTGTGCCCT CTCCATTCCA 
CTGCTGT66C AGGGTCAGAA ATCTTGGATA GAGAAAACCT TTTGCAAACG GGAATGTATC 
TTTGTAATTC CTAGCAC6AA A6ACTCTAAC AGGTGTTGCT 6TGGCCA6TT CACCAACCA6 
CATATCCCCC CTCTGCCAAG T6CAACACCC AGCAAAAATG AAGAGGAAA6 CAAACAGGTG 
GAGACTCAGC CTGAGAAATG GTCTGTTGCC AAGCACACCC AGAGCTACCC AACA6ATTCC 
TATGGAGTTC TTGAATTCCA GGGTGGCGGA TATTCCAATA AAGCC ATG TAT ATC 

1 

CGT GTA TCC TAT GAC ACC AAG CCA GAC TCA CTG CTC CAT CTC ATG GT6 
Sg ?S sS Sr ASP Thr Lys Pro Asp Ser Leu Leu Hxs Leu Met Val 

AAA GAT TGG CAG CTG GAA CTC CCC AAG CTC TTA ATA TCT GTG CAT GGA 
^ 2p Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser Val His Gly 

20 

ncc CTC CAG AAC TTT GAG ATG CAG CCC AAG CTG AAA CAA 6TC TTT GGG 
^ Se Glu Met Gin Pro Lys Leu Lys Gin Val Phe Gly 



40 



^ ss ii? ^ ^. ?| ?s ^ ?s iir^r, ?s 



55 



70 75 



60 
120 
180 
240 
300 
354 

402 

450 

498 

546 

594 



GAC CAC TCC TCC AAG TCC A6A 6GC CG6 GTT TGT GCT ATA GGA ATT GOT 642 
ASP Sb S r ser Lys S r Arg Gly Arg Val Cys Ala He Gly He Ala 
85 9° 

CCA TGG GGC ATC GT6 GAG AAT AAG GAA GAC CTG GTT GGA AAG GAT GTA 
So Trp Gly He Val Glu Asn Lys Glu Asp Leu Val Gly Lye Asp Val 
100 

era TAC CAG ACC ATG TCC AAC CCT CTA AGT AAG CTC TCT GTG 
?S J?g ?S Gin Het Ser Asn Pro Leu Ser Lys Leu Ser Val 

CTC AAC AAC TCC CAC ACC CAC TTC ATC CTG GCT GAC AAT GGC ACC CTG 
SS ser His Thr His Phe lie Leu Ala Asp Asn Gly Thr Leu 



135 



GGC AAG TAT GGC GCC GAG GTG AAG CTG CGA AGG CTG CTG GAA AAG CAC 
S5 JJs Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu Glu Lys Hxs 
150 155 

ATC TCC CTC CAG AAG ATC AAC ACA A6A CTG GGG CAG GGC GTG CCC CTC 
ill sS SS iys lie Asn Thr Arg Leu Gly Gin Gly Val Pro Leu 
165 170 



200 



AGC GGA CGT GCC TCG GAC ATC CTG TCC TTT GCG CAC AAG TAC TGT GAA 
sS Arg Ala Ser Asp He Leu Ser Phe Ala His Lys Tyr Cys Glu 

GAA GGC GGA ATA ATA AAT GAG TCC CTC AGG GAG CAG CTT CTA GTT ACC 
S5 ne ne Asn Glu Ser Leu Arg Glu Gin Leu Leu Val Thr 

ATT CAG AAA ACA TTT AAT TAT AAT AAG GCA CAA TCA CAT CAG CTG TTT 
ne Sn Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His Gin Leu Phe 
245 250 255 

GCA ATT ATA ATG GAG TGC ATG AAG AAG AAA GAA CTC GTC ACT GTG TTC 
Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val Thr Val Phe 
260 265 27U 

AGA ATG GGT TCT GAG GGC CAG CAG GAC ATC GAG ATG GCA ATT TTA ACT 
Sy ser Glu Gly Gin Gin Asp He Glu Met Ala He Leu Thr 

280 285 

GCC CTG CTG AAA GGA ACA AAC GTA TCT GCT CCA GAT CAG CTG AGC TTG 
Sa L^ L^ Lys Gly Thr Asn Val Ser Ala Pro Asp Gin Leu Ser Leu 
295 

GCA CTG GCT TGG AAC CGC GTG GAC ATA GCA CGA AGC CAG ATC TTT GTC 
S Sp Asn Arg Val Asp He Ala Arg Ser Gin He Phe Val 
310 315 32a 

TTT GGG CCC CAC TGG ACG CCC CTG GGA AGC CTG GCA CCC CCG ACQ GAC 
?Se 2?? Pro Ss Trp Thr Pro Leu Gly Ser Leu Ala Pro Pro Thr Asp 
325 330 335 



690 



738 



786 



834 



882 



930 



GTG GGT CTC GTG GTG GAG GGG GGC CCT AAC GTG GTG TCC ATC GTC TT6 
vTl S?J Leu val val Glu Gly Gly Pro Asn Val Val Ser He Val Leu 

^ 180 



1026 



1074 



1122 



1170 



1218 



1266 



1314 



1362 



340 

360 '^^^ 

375 380 

GTG AAT GCT TTG GAG CAA GCG ATG CTA GAT GCT TTA GTC TTA GAT OCT 
?S S Glu Gin Ala Met Leu Asp Ala Leu Val Leu Asp Arg 

390 



405 

TTT CT6 ACC ATT CCG AGO CTG GAG GAG CTT TAT AAC ACA AGA CTG GGT 
Z ?S? ?S ??o Arg Leu Glu Glu Leu Tyr Abh Thr Arg Leu Gly 
420 425 «o 

455 460 

GAG TAG CTC ATG GGA GGA GCC TAC CGC TGC AAC TAC ACT CGG AAA AAC 
Sr LeS Set Gly Gly Ala Tyr Arg Cys Asn Tyr Thr Arg Lys Asn 
470 *75 

TTT CGG ACC CTT TAC AAC AAC TTG TTT GGA CCA AA6 AGO CCT AAA GCT 
S ?Sr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg Pro Lys Ala 
485 



CTT AAA CTT CTG GGA ATG GAA GAT GAT GAG CCT CCA GCT AAA 

iJJ 2S lS Gly Met Glu ASP Asp Glu Pro Pro Ala Lys Gly Lys 



500 



^ ^ Si ^ ^ tJi |s ?s r4 ?s 



520 



GAC CCT GCC GTG AGT CGG TTC CAG TAT CCC TTC CAC GAG CTG ATG GTG 
5S ?S Ser Arg Phe Gin Tyr Pro Phe His Glu Leu Met Val 



535 



550 555 
565 ^'^^ 
580 585 



1410 



1458 



1506 



1554 



1602 



1650 



1698 



1746 



1794 



1842 



1890 



1938 



1986 



2034 



2082 



2130 



TCC CAG GAC TTG GAT AAC AAT TCC AAA GAC TTC GGC CAG CTT GCT TT6 2178 
Ser Gin Asp Leu Asp Asn Asn Ser Lys Asp Ph Gly Gin Leu Ala Leu 
600 605 olO 

GAG TTA TTA GAC CAG TCC TAT AA6 CAT GAC GAG CAG ATC GCT ATG AAA 2226 
Glu Leu Leu Asp Gin S r Tyr Lys His Asp Glu Gin II Ala Met Lys 
615 620 625 

CTC CTG ACC TAC GAG CTG AAA AAC TGG AGC AAC TC6 ACC TGC CTC AAA 2274 
Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ser Thr Cys Leu Lys 
630 635 640 

CTG GCC GTG GCA GCC AAA CAC CGG GAC TTC ATT GCT CAC ACC TGC AGC 2322 
Leu Ala val Ala Ala Lys His Arg Asp Phe He Ala His Thr Cys Ser 
645 650 655 

CAG ATG CTG CTG ACC GAT ATG TGG ATG GGA AGA CTG CGG ATG CGG AAG 2370 
Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg Met Arg Lys 
660 665 670 675 

AAC CCC GGC CTG AAG GTT ATC ATG GG6 ATT CTT CTA CCC CCC ACC ATC 2418 
Asn Pro Gly Leu Lys Val He Met Gly He Leu Leu Pro Pro Thr He 
680 685 690 

TTG TTT TTG GAA TTT CGC ACA TAT GAT GAT TTC TCG TAT CAA ACA TCC 2466 
Leu Phe Leu Glu Phe Arg Thr Tyr Asp Asp Phe Ser Tyr Gin Thr Ser 
695 700 v05 

AAG GAA AAC GAG GAT GGC AAA GAA AAA GAA GAG GAA AAT ACG GAT GCA 2514 
Lvs Glu Asn Glu Asp Gly Lys Glu Lys Glu Glu Glu Asn Thr Asp Ala 
' 710 715 720 

AAT GCA GAT GCT GGC TCA AGA AAG GGG GAT GAG GAG AAC GAG CAT AAA 2562 
Asn Ala Asp Ala Gly Ser Arg Lys Gly Asp Glu Glu Asn Glu His Lys 
Q 725 730 735 

W AAA CAG AGA AGT ATT CCC ATC GGA ACA AAG ATC TGT GAA TTC TAT AAC 2610 

m Lys Gin Arg Ser He Pro He Gly Thr Lys He Cys Glu Phe Tyr Asn 

LsJ 740 745 750 755 

GOG CCC ATT GTC AAG TTC TGG TTT TAC ACA ATA TCA TAC TTG GGC TAC 2658 
'-^ Ala Pro He Val Lys Phe Trp Phe Tyr Thr He Ser Tyr Leu Gly Tyr 

760 765 770 

CTG CTG CTG TTT AAC TAC GTC ATC CTG GTG CGG ATG GAT GGC TGG CCG 2706 
Leu Leu Leu Phe Asn Tyr Val He Leu Val Arg Met Asp Gly Trp Pro 
775 780 785 

TCC CTC CAG GAG TGG ATC GTC ATC TCC TAC ATC GTG AGC CTG GCG TTA 2754 
Ser Leu Gin Glu Trp He Val He Ser Tyr He Val Ser Leu Ala Leu 
790 795 800 

GAG AAG ATA CGA GAG ATC CTC ATG TCA GAA CCA GGC AAA CTC AGC CAG 2802 
Glu Lys He Arg Glu He Leu Met Ser Glu Pro Gly Lys Leu Ser Gin 
805 810 815 

AAA ATC AAA GTT TGG CTT CAG GAG TAC TGG AAC ATC ACA GAT CTC GTG 2850 
Lys He Lys Val Trp Leu Gin Glu Tyr Trp Asn He Thr Asp Leu Val 
820 825 830 835 

GCC ATT TCC ACA TTC ATG ATT GGA GCA ATT CTT CGC CTA CAG AAC CAG 2898 
Ala He ser Thr Ph Met He Gly Ala He Leu Arg Leu Gin Asn Gin 
840 845 850 



m 



CCC TAC AT6 GGC TAT GGC CGG GTG ATC TAG TGT GTG GAT ATC ATC TTC 
Pro Tyr M t Gly Tyr Gly Arg Val II Tyr Cys Val Asp lie lie Phe 
855 860 865 

TGG TAC ATC CGT GTC CTG GAC ATC TTT GGT GTC AAC AAG TAT CTG GGG 
Trp Tyr lie Arg Val Leu Asp lie Phe Gly Val Asn Lys Tyr Leu Gly 
*^ ' 870 875 880 

CCA TAC GTG ATG ATG ATT GGA AAG ATG ATG ATC GAC ATG CTG TAC TTT 
Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met Leu Tyr Phe 
885 890 895 

GTG GTC ATC ATG CTG GTC GTG CTC ATG AGT TTC GGA GTA GCC CGT CAA 
Val val He Met Leu Val Val Leu Met Ser Phe Gly Val Ala Arg Gin 
900 905 910 915 

GCC ATT CTG CAT CCA GAG GAG AAG CCC TCT TGG AAA CTG GCC CGA AAC 
Ala He Leu His Pro Glu Glu Lys Pro Ser Trp Lys Leu Ala Arg Asn 
920 925 930 

ATC TTC TAC ATG CCC TAC TGG ATG ATC TAT GGA GAG GTG TTT GCA GAC 
He Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu Val Phe Ala Asp 
935 940 945 

CAG ATA GAC CTC TAC GCC ATG GAA ATT AAT CCT CCT TGT GOT GAG AAC 
Gin He Asp Leu Tyr Ala Met Glu He Asn Pro Pro Cys Gly Glu Asn 
950 955 960 

CTA TAT GAT GAG GAG GGC AAG CGG CTT CCT CCC TGT ATC CCC GGC GCC 
Leu Tyr Asp Glu Glu Gly Lys Arg Leu Pro Pro Cys He Pro Gly Ala 
965 970 975 

TGG CTC ACT CCA GCA CTC ATG GCG TGC TAT CTA CTG GTC GCC AAC ATC 
Trp Leu Thr Pro Ala Leu Met Ala Cys Tyr Leu Leu Val Ala Asn He 
980 985 990 995 

CTG CTG GTG AAC CTG CTG ATT GCT GTG TTC AAC AAT ACT TTC TTT GAA 
Leu Leu Val Asn Leu Leu He Ala Val Phe Asn Asn Thr Phe Phe Glu 
1000 1005 1010 

GTA AAA TCA ATA TCC AAC CAG GTG TGG AAG TTC CAG CGA TAT CAG CTG 
Val Lys ser He Ser Asn Gin Val Trp Lys Phe Gin Arg Tyr Gin Leu 
1015 1020 1025 

ATT ATG ACA TTT CAT GAC AGG CCA GTC CTG CCC CCA CCG ATG ATC ATT 
He Met Thr Phe His Asp Arg Pro Val Leu Pro Pro fro Met He He 
1030 1035 1040 

TTA AGC CAC ATC TAC ATC ATC ATT ATG CGT CTC AGC GGC CGC TGC AGG 
Leu Ser His He Tyr He He He Met Arg Leu Ser Gly Arg Cys Arg 
1045 1050 1055 

AAA AAG AGA GAA GGG GAC CAA GAG GAA CGG GAT CGT GGA TTG AAG CTC 
Lys Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly Leu Lys Leu 
1060 1065 1070 1075 

TTC CTT AGC GAC GAG GAG CTA AAG AGG CTG CAT GAG TTC GAG GAG CAG 
Phe Leu Ser Asp Glu Glu Leu Lys Arg Leu His Glu Phe Glu Glu Gin 
1080 1085 1090 

TGC GTG CAG GAG CAC TTC CGG GAG AAG GAG GAT GAG CAG CAG TCG TCC 
Cys Val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin Gin Ser Ser 
1095 1100 1105 



ACC GAC GAG CGC ATC CGG GTC ACT TCT 6AA AGA GTT GAA AAT ATG TCA 
Jer Asp lu Arg II Arg Val Thr S r Glu Arg Val Glu Asn Met Ser 
1110 1115 1120 

ATG AGO TTG GAA GAA ATC AAT GAA AGA GAA ACT TTT ATG AAA ACT TCC 
Met Arg Leu Glu Glu II Asn Glu Arg Glu Thr Phe Met Lys Thr Ser 
1125 1130 1135 

CTG CAG ACT GTT GAC CTT C6A CTT 6CT CAG CTA GAA GAA TTA TCT AAC 
Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu Leu Ser Asn 
1140 1145 1150 1155 

AGA ATG 6T6 AAT GCT CTT GAA AAT CTT GC6 GGA ATC GAC AGO TCT GAC 
Arg Met Val Asn Ala Leu Glu Asn Leu Ala Gly He Asp Arg fer Asp 
1160 1165 ino 

CTG ATC CAG GCA CGG TCC CGG GCT TCT TCT GAA T6T GAG GCA ACG TAT 
Leu He Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu Ala Thr Tyr 
1175 1180 1185 

CTT CTC CGG CAA AGC AGC ATC AAT AGC GCT GAT GGC TAC AGC TTG TAT 
Leu Leu Arg Gin Ser Ser He Asn Ser Ala Asp Gly Tyr Ser Leu Tyr 
1190 1195 1200 

CGA TAT CAT TTT AAC GGA GAA GAG TTA TTA TTT GAG GAT ACA TCT CTC 
Ara Tvr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp Thr Ser Leu 
^ 1205 1210 1215 

TCC ACG TCA CCA GGG ACA GGA GTC AGG AAA AAA ACC TGT TCC TTC CGT 
ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys Ser Phe Arg 
1220 1225 1230 1235 

ATA AAG GAA GAG AAG GAC GTG AAA ACG CAC CTA GTC CCA GAA TGT CAG 
He Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro Glu Cys Gin 
1240 1245 1250 

AAC AGT CTT CAC CTT TCA CTG GGC ACA AGC ACA TCA GCA ACC CCA GAT 
Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala Thr Pro Asp 
1255 1260 1265 

GGC AGT CAC CTT GCA GTA GAT GAC TTA AAG AAC GCT GAA GAG TCA AAA 
Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn Ala Glu Glu Ser Lys 
' 1270 1275 1280 

TTA GGT CCA GAT ATT GGG ATT TCA AAG GAA GAT GAT GAA AGA CAG ACA 
Leu Gly Pro Asp He Gly He Ser Lys Glu Asp Asp Glu Arg Gin Thr 
1285 1290 1295 

GAC TCT AAA AAA GAA GAA ACT ATT TCC CCA AGT TTA AAT AAA ACA GAT 
Asp Ser Lys Lys Glu Glu Thr He Ser Pro Ser Leu Asn Lys Thr Asp 
1300 1305 1310 1315 

GTG ATA CAT GGA CAG GAC AAA TCA GAT GTT CAA AAC ACT CAG CTA ACA 
Val He His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr Gin Leu Thr 
1320 1325 1330 

GTG GAA ACG ACA AAT ATA GAA GGC ACT ATT TCC TAT CCC CTG GAA GAA 
Val Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro Leu Glu Glu 
1335 1340 1345 

ACC AAA ATT ACA CGC TAT TTC CCC GAT GAA ACG ATC AAT GCT TGT AAA 
Thr Lys He Thr Arg Tyr Ph Pro Asp Glu Thr He Asn Ala Cys Lys 
1350 1355 1360 



ACA ATG AAG TCC AGA AGC TTC GTC TAT TCC CG6 GGA AGA AAG CTG GTC 
Thr Met LvB S r Arg Ser Phe Val Tyr Ser Arg Gly Arg Lys Leu Val 
1365 1370 1375 

G6T GGG GTT AAC CAG GAT 6TA GAG TAG AGT TCA ATC A06 GAC CAG CAA 
Gly Gly val Asn Gin Asp Val Glu Tyr Ser Ser He Thr Asp Gin Gin 
1380 1385 1390 1395 

TTG ACG ACG GAA TGG CAA TGC CAA GTT CAA AAG ATC ACG CGC TCT CAT 
Leu Thr Thr Glu Trp Gin Cys Gin Val Gin Lys He Thr Arg fer His 
1400 1405 1410 

AGC ACA GAT ATT CCT TAC ATT GTG TCG GAA GCT GCA GTG CAA 6CT GAG 
ser Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val Gin Ala Glu 
1415 1420 1425 

CAA AAA GAG CAG TTT GCA GAT ATG CAA GAT GAA CAC CAT GTC GCT GAA 
Gin Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His Val Ala Glu 
1430 1435 1440 

GCA ATT CCT CGA ATC CCT CGC TTG TCC CTA ACC ATT ACT GAC AGA AAT 
Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr Asp Arg Asn 
1445 1450 1455 

GGG ATG GAA AAC TTA CTG TCT GTG AAG CCA GAT CAA ACT TTG GGA TTC 
Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr Leu Gly Phe 
1460 1465 1470 1475 

CCA TCT CTC AGG TCA AAA AGT TTA CAT GGA CAT CCT A6G AAT GTG AAA 
Ser I 
1480 



CCA TCT CTC AGG TCA AAA Atti iiA «-«x w« v-n* ^y"- ; 

Pro ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg Asn Val Lys 
- 1485 i««yu 



TCC ATT CAG GGA AAG TTA GAC AGA TCT GGA CAT GCC AGT AGT GTA AGC 
ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser Ser Val Ser 
1495 1500 1505 

AGC TTA GTA ATT GTG TCT GGA ATG ACA GCA GAA GAA AAA AAG GTT AAG 
ser Leu Val He Val Ser Gly Met Thr Ala Glu Glu Lys Lys Val Lys 
1510 1515 1520 

AAA GAG AAA GCT TCC ACA GAA ACT GAA TGC T AGTCTGTTTT GTTTCTTTAA 
Lvs Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1525 1530 

TTXTTTTTTT TAACAGTCAG AAACCCACTA ATGGGTGTCA TCTTOGCCCA TCCTAAACAC 
ATMTCCAATT TCCTAAAAAC ATTTTCCCTT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 amino acids 

(B) TYPE: eunino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu His 
15 10 15 

Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser 
20 25 30 



Val Hie Gly Gly Leu Gin Asn Phe Glu M t Gin Pro Lys Leu Lys Gin 
35 40 45 

Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr Thr Gly Ala Trp 
50 55 60 

He Ph Thr Gly Gly Val Ser Thr Gly Val He Ser His Val Gly Asp 
65 70 75 80 

Ala Leu Lys Asp His Ser Ser Lys Ser Arg Gly Arg Val Cys Ala He 
85 90 95 

Glv He Ala Pro Trp Gly He Val Glu Asn Lys Glu Asp Leu Val Gly 
^ 100 105 110 

Lys Asp Val Thr Arg Val Tyr Gin Thr Met Ser Asn Pro Leu Ser Lys 
115 120 125 

Leu ser Val Leu Asn Asn Ser His Thr His Phe He Leu Ala Asp Asn 
130 135 140 

Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu 

- - *i e c 160 



145 



150 



155 



Glu Lys His He Ser Leu Gin Lys He Asn Thr Arg Leu Gly Gin Gly 
165 170 175 

Val Pro Leu Val Gly Leu Val Val Glu Gly Gly Pro Asn Val Val Ser 
180 185 190 

He Val Leu Glu Tyr Leu Gin Glu Glu Pro Pro He Pro Val Val He 
195 200 205 

Cys Asp Gly Ser Gly Arg Ala Ser Asp He Leu Ser Phe Ala His Lys 
210 215 220 

Tvr Cys Glu Glu Gly Gly He He Asn Glu Ser Leu Arg Glu Gin Leu 
225 230 235 240 

Leu val Thr He Gin Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His 
245 250 255 

Gin Leu Phe Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val 
260 265 270 

Thr Val Phe Arg Met Gly Ser Glu Gly Gin Gin Asp He Glu Met Ala 
275 280 285 

He Leu Thr Ala Leu Leu Lys Gly Thr Asn Val Ser Ala Pro Asp Gin 
290 295 300 

Leu Ser Leu Ala Leu Ala Trp Asn Arg Val Asp He Ala Arg Ser Gin 
305 310 315 320 

He Phe Val Phe Gly Pro His Trp Thr Pro Leu Gly Ser Leu Ala Pro 
325 330 335 

Pro Thr Asp Ser Lys Ala Thr Glu Lys Glu Lys Lys Pro Pro Met Ala 
340 345 350 

Thr Thr Lys Gly Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Gly Lys 
355 360 365 



Val Lys Glu Glu Val Glu Glu Glu Thr Asp Pro Arg Lys He Glu Leu 
370 375 380 

Leu Asn Trp Val Asn Ala Leu Glu Gin Ala M t L u Asp Ala Leu Val 
385 390 395 400 

L u Asp Arg Val Asp Phe Val Lys Leu Leu He Glu Asn Gly Val Asn 
405 410 415 

Met Gin His Phe Leu Thr He Pro Arg Leu Glu Glu Leu Tyr Asn Thr 
420 425 430 

Arg Leu Gly Pro Pro Asn Thr Leu His Leu Leu Val Arg Asp Val Lys 
435 440 445 

Lys Ser Asn Leu Pro Pro Asp Tyr His He Ser Leu He Asp He Gly 
450 455 460 

Leu Val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr 
465 470 475 480 

Aro Lys Asn Phe Arg Thr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg 
485 490 495 

Pro Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp Glu Pro Pro Ala 
500 505 510 

Lvs Gly Lys Lys Lys Lys Lys Lys Lys Lys Glu Glu Glu He Asp He 
^ 515 520 525 

Asp Val Asp Asp Pro Ala Val Ser Arg Phe Gin Tyr Pro Phe His Glu 
530 535 540 

Leu Met Val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Val Phe 
545 550 555 560 

Leu Trp Gin Arg Gly Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys 
565 570 575 

Lys Leu Tyr Lys Ala Met Ala His Glu Ser Ser Glu Ser Asp Leu Val 
580 585 590 

Asp Asp He Ser Gin Asp Leu Asp Asn Asn Ser Lys Asp Phe Gly Gin 
595 600 605 

Leu Ala Leu Glu Leu Leu Asp Gin Ser Tyr Lys His Asp Glu Gin He 
610 615 620 

Ala Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ser Thr 
625 630 635 640 

Cys Leu Lys Leu Ala Val Ala Ala Lys His Arg Asp Phe He Ala His 
645 650 655 

Thr Cys Ser Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg 
660 665 670 

Met Arg Lys Asn Pro Gly Leu Lys Val He Met Gly He Leu Leu Pro 
675 680 685 

Pro Thr II Leu Ph Leu Glu Ph Arg Thr Tyr Asp Asp Ph Ser Tyr 
690 695 700 



Gin Thr Ser Lys Glu Asn Glu Asp Gly Lys Glu Lys Glu Glu Glu Asn 
705 710 715 720 

Thr Asp Ala Asn Ala Asp Ala Gly Ser Arg Lys Gly Asp Glu Glu Asn 
725 730 735 

Glu His Lys Lys Gin Arg Ser lie Pro lie Gly Thr Lys He Cys Glu 
740 745 750 

Phe Tyr Asn Ala Pro He Val Lys Phe Trp Phe Tyr Thr He Ser Tyr 
755 760 765 

Leu Gly Tyr Leu Leu Leu Phe Asn Tyr Val He Leu Val Arg Met Asp 
770 775 780 

Gly Trp Pro Ser Leu Gin Glu Trp He Val He Ser Tyr He Val Ser 
785 790 795 800 

Leu Ala Leu Glu Lys He Arg Glu He Leu Met Ser Glu Pro Gly Lys 
805 810 815 

Leu ser Gin Lys He Lys Val Trp Leu Gin Glu Tyr Trp Asn He Thr 
820 825 830 

ASP Leu val Ala He Ser Thr Phe Met He Gly Ala He Leu Arg Leu 
835 840 845 

Gin Asn Gin Pro Tyr Met Gly Tyr Gly Arg Val He Tyr Cys Val Asp 
850 855 860 

He He Phe Trp Tyr He Arg Val Leu Asp He Phe Gly Val Asn Lys 
865 870 875 880 

Tyr Leu Gly Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met 
885 890 895 

Leu Tvr Phe Val Val He Met Leu Val Val Leu Met Ser Phe Gly Val 
900 905 910 

Ala Arg Gin Ala He Leu His Pro Glu Glu Lys Pro Ser Trp Lys Leu 
915 920 925 

Ala Arg Asn He Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu Val 
930 935 940 

Phe Ala Asp Gin He Asp Leu Tyr Ala Met Glu He Asn Pro Pro Cys 
945 950 955 960 

Gly Glu Asn Leu Tyr Asp Glu Glu Gly Lys Arg Leu Pro Pro Cys He 
965 970 975 

Pro Gly Ala Trp Leu Thr Pro Ala Leu Met Ala Cys Tyr Leu Leu Val 
980 985 990 

Ala Asn He Leu Leu Val Asn Leu Leu He Ala Val Phe Asn Asn Thr 
995 1000 1005 

Phe Phe Glu Val Lys Ser He Ser Asn Gin Val Trp Lys Phe Gin Arg 
1010 1015 1020 

Tyr Gin Leu He Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro 
1025 1030 1035 1040 



Met He II L u Ser His II Tyr He He He Met Arg Leu Ser Gly 
1045 1050 1055 

Aro Cvs Arg Lys Lys Arg Glu Gly Asp Gin Glu Glu Arg Asp Arg Gly 
1060 1065 1070 

Leu Lys Leu Phe Leu S r Asp Glu Glu Leu Lys Arg Leu His Glu Phe 
1075 1080 1085 

Glu Glu Gin Cys Val Gin Glu His Phe Arg Glu Lys Glu Asp Glu Gin 
1090 1095 1100 

Gin Ser Ser Ser Asp Glu Arg He Arg Val Thr Ser Glu Arg Val Glu 
1105 1110 1115 1120 

Asn Met Ser Met Arg Leu Glu Glu He Asn Glu Arg Glu Thr Phe Met 
1125 1130 1135 

Lys Thr Ser Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu 
1140 1145 1150 

Leu Ser Asn Arg Met Val Asn Ala Leu Glu Asn Leu Ala Gly He Asp 
1155 1160 1165 

Arg ser Asp Leu He Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu 
1170 1175 1180 

Ala Thr Tyr Leu Leu Arg Gin Ser Ser He Asn Ser Ala Asp Gly Tyr 
1185 1190 1195 1200 

Ser Leu Tyr Arg Tyr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp 
1205 1210 1215 

Thr Ser Leu Ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys 
1220 1225 1230 

Ser Phe Arg He Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 
1235 1240 1245 

Glu Cys Gin Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala 
1250 1255 1260 

Thr Pro Asp Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn Ala Glu 
1265 1270 1275 1280 

Glu Ser Lys Leu Gly Pro Asp He Gly He Ser Lys Glu Asp Asp Glu 
1285 1290 1295 

Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr He Ser Pro Ser Leu Asn 
1300 1305 1310 

Lys Thr Asp Val He His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr 
1315 1320 1325 

Gin Leu Thr Val Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro 
1330 1335 1340 

Leu Glu Glu Thr Lys He Thr Arg Tyr Phe Pro Asp Glu Thr He Asn 
1345 1350 1355 1360 

Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg 
^ 1365 1370 1375 



Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu Tyr Ser Ser He Thr 
1380 1385 1390 

Asp Gin Gin L u Thr Thr Glu Trp Gin Cys Gin Val Gin Lys He Thr 
1395 1400 1405 

Arg Ser His Ser Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val 
1410 1415 1420 

Gin Ala Glu Gin Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His 
1425 1430 1435 1440 

Val Ala Glu Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr 
1445 1450 1455 

Asp Arg Asn Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr 
1460 1465 1470 

Leu Gly Phe Pro Ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg 
1475 1480 1485 

Asn Val Lys Ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser 
1490 1495 1500 

Ser Val Ser Ser Leu Val He Val Ser Gly Met Thr Ala Glu Glu Lys 
1505 1510 1515 1520 

Lys Val Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1525 1530 



